DOCUMENT RESUME 



ED 237 399 

AUTHOR 
TITLE 

SPON5 AGENCY 

REPORT NO 
PUB DATE 
NOTE 



PUB TYPE 



\ 



EDRS PRICE 
DESCRIPTORS 



ABSTRACT 



so 015 iio 

Moss, Pamela A.; Petrosky, Anthony R. 

A Proposal for Measuring Critical Thinking. 

Pittsburgh Board of Public Education, PaL ; Richard 

King Mellon Foundation, Pittsburgh Pa 

T-83-2 . 

Sep 83 ■ 

23p, | Revised version of paper presented at the 
Annual Meeting of the American Educational Research 
Association (Montreal, Canada, April 11-15, 1983) 
Tests/Evaluation instruments (160) — 
Speeches/Conference Papers (150) — Reports - 
Descriptive (141) 

MF01/PC01 Plus Postage, 

♦Critical Thinking- Definitions; *Biagnostic Tests- 
Essay Tests; Scoring; Secondary Education- Skill 
Development; *Social Studies; Teacher Education 



. . „ , The diagnostic testing component of the Critical 

2EiS;:?. P KJ # !£ p£ the pi , ttsh ^ h schools is described and a 
rationale for the prefect's particular choice of testing procedures 

s kilfn« ed ;J h e- PUrpe ? e - o£ the ^ r °3 ect was to help studefO livefop 
1 V U inkl 2? critically aSad communicating critical though! P 
stulil^ "^ din9 1 ' discussion, and essay writing within the sectfl 
studies curnculum; The project defined critical thinking as a 
dynamic process of questioning and reasoning that reflects a 
■Jh?«i- n9, P^^^ical orientation toward the world, Critical 
thinking skills are interdependent emphasizing both oral and written 
expression. Because no existing tests Suited the project's 
Se'or S;^ 10 ? of critical thinking or met its practical needs, 
the project developed a testing procedure that asked students to read 
passages relevant to their social studies curriculum and wrife essavs 
m response to questions which asked them to do such lasks as Y 
!I .£111 il d " w . in « er ?nces from what they had read. Asking students 

IIS l ld ? as c ^ earl y and to justify or defend those ideas 
with coherent explanations and evidence provides insight into the 
apiiv to S P °-! S f; T anal yti«l scoring guide which teachers can 
apply to their student essays quickly to obtain relevant diagnostic 
JSi^^'S"'^ 8 BlB ° d6 ^loped, included are two sample essays 
followed by excerpts from the project's teacher training materials 
which provide the rationale for the scores given and sulgestions for 
providing feedback to the students who wrote the essaysf^ppendld are 

version iVt ™ rsio \ oi a critical thinking test and a detailed 
version of a diagnostic scoring guide, (RM) 



********.**************************^ 

^ Reproductions supplied by EDRS are the best that can be made * 

from the original document * 

********************************* J***^ 




Monitoring Achievement in Pittsburgh 

Critical Thinking Project 
Pittsburgh Public Schools 
Richard C. Wallace, Jr. , Superintendent 



A PROPOSAL FOR MEASURING CRITICAL THINKING 

Pamela A. Moss and Anthony R, Petrosky 
University of Pittsburgh 



U-S, Of PARTNff NT OF EDUCATION 
NATIONAL INSTITUTE OF EDUCATION 
EDUCATIONAL RESOURCES INFORM AT(ON 

CENTER fimcj 
t : This deeumeni has b«n i^roduced as 
««,vey fr om the person or Of g 8n « 3 ,f on 
jonginatmg it. 
V Minor change havc , b4?en m ^ {Q impfQ ^ 
reproduction quality. 4 

* Po,n? S or vlovy or .opinion* 5tar7^n\h,s7ocu~ 
mem do nor necessary r fei prD S enf oH^a! Nig 
Position or polity. 



"PERMISSION TO REPRODUCE THIS 
MATIRJAL HAS BEEN GRANTED BY 



Mss 



TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC)," 



September 1983 
Report No- T-33-2 



Revised version of a paper presented at the Annual Meeting of the 
American Educational Research Association, Montreal, Quebec Cana 
April 1983, ■ 

The MAP Critical Thinking Project is funded by a grant from the 
Richard King Mellon Foundation and the Pittsburgh Board of Public 
Education, 



Division of Curriculum Development 
341 South Bellefield Avenue 
Pittsburgh, Pennsylvania 15213 



ABSTRACT 



The Critical' Thinking Project of the p ittsburgh Pub ii c Schools 
aims Co help Students develop skills for thinking critically and com- 
municating critical thought through reading; discussion, and essay 
writing within the social studies curriculum. The project, which 
began in September 1982, has three related components-instruction, 
assessment, and staff development . It involves selected teachers and 
supervisors from the school district in the development, adaptation, 
and use of the instructional techniques, the diagnostic tests, and 
the staff development itself. This paper describes the diagnostic 
testing component of the project (essay tests rated by classroom 
teachers using an analytic scoring guide) and provides a rationale 
for choosing essay tests over alternative testing formats for 
measuring critical thinking, " 



A Proposal for Measuring Critical Thirki 



ng 



In September 1982, the Pittsburgh Public Schools began a three-year 
Critical Thinking Project to develop « curriculum to teach critical thinking 
skills. The project aims to help students develop skills for thinking 
critically and communicating critical thought through reading, discussion, and 
essay writing within the social' studies curriculum. The project has three 
related components— instruction , assassin, nf , and staff development. It 
involves selected teachers and supervisors from the schuo2 district in the 
development, adaptation, and use of the instructional techniques, the 
diagnostic tests, and, the staff development itself. 

This paper describes the diagnostic testing component of the project and 
provides a rationale for our particular choice of tasting procedures. 
Although instruction lis the primary emphasis of the project, we recognize that 
testing can interact jwith instruction in some very important ways. It can ■ 
provide feedback to teachers regarding the success of tneir instruction in 
skills for critical thinking and to students regarding their success in ' 
mastering those skiljLs. It can help teachers plan instruction by providing 
information on their students' strengths and weaknesses. Ultimately, it can 
pr ° mot ^ instruction in critical thinking, as teachers are more likely to teach 
what will be tested. 

1 . 

The Rationale 

The term, critical thinking, and related terms such as problem-solving, 
rational thinking, and reflective inquiry are an integral part of our public 
1 ' « 

The following five paragraphs are based upon a project working paper 
prepared by Catherine Cornbleth, University of Pittsburgh, and Thomas S 
Popkewitz, University of Wisconsin— Madison . 



language wK», ... „,„ , ho purposes of education. However, there is 

little Bar - * ^dv ".tors about what critical thinking is. Definitions 

of critic: :u -fc1.- .we been proposed in the abundant literature on the 

subject aim, invars,'"'- involve lists of "skills" or "aspects" or "steps" 
Which ar, .,, » ■ the construct, (See MePeck, 1982, and Cernbieth, 1378, 

for rev. - M literature.) While there is some underlying commonality 

among mans ... „ definitions* the diversity of cognitive processes , 

knowledge » r.ru." : ..res , and affective orientations that have been proposed as 
components of critical thinking makes it difficult to delimit the concept or 
to capture its essence. 

In our still evolving effort to come to grips with the meaning and 
manifestations off critical thinking, we have tentatively concluded that to 
define critical thinking as a . series of discrete skill, or steps is 
misrepresentative in at least two ways". First, it suggests that critical 
thinking is or should be linear, which it rarely if ever is or could be. ft 8 
one question is explored, for example, others may be raised or pursued while 
the initial question is put aside. temporarily or abandoned altogether. 
Secondly, critical thinking is creative in ways analogous to writing poetry or 
sculpting. Reducing critical thinking to a series off skills is iifc e painting 
by number. Such a conception of critical thinking suggests that the parts add 
up to make the whole. Unlike jigsaw puzzle, and most automobiles, critical 
thinking cannot be taken apart and reasssmMed without damage. 

We have come to understand critical thln.'ing as a dynamic process of 
questioning and reasoning that reflects what might be described as a trusting, 
yet skeptical, orientation toward the world. it is active inquiry rather than 
passive acceptance of tradition and authority or , "common sense." To think 
critically is to question public statements, beliefs, and definitions. It 



ERIC 



3 t 

involves considering what might have floen and what may yet be, 

That orientation to critical thinking has guided the development of the 
project's instructional goals and strategies and related assessment 
procedures. When we think of critical thinking skills, we look to skills that 
contribute to critical thinking, not skills which stand as separate things to 
be taught independently. Depending on how one prefers to think about „' ills, 
critical thinking can be seen as involving a few or dozens of skills, 
including, but not limited to reading and writing skills; academic or study or 
library research skills, and so-called higher level skills such as logical 
analysis, summarizing, classifying, inferring, and evaluating. 

An- emphasis on oral and written expression is central, we think, to 
instruction and measurement in critical thinking. Talking and writing strike 
us as generative in the same -sense that thinking is generative. When students 
think, talk, and write, they make meaning; and whan students focus on making 
meaning in critical ways, they end up with extended discourse which can both 
create and present critical thinking. The expressions of critical thought- 
discussions, compositions, and other types of presentations-^ara ways of both 
discovering and verbalizing questions, conclusions, and the paths by which 
those conclusions were reached. The substance and quality of these 
expressions provide evidence of more or lees critical thought, 

W« found no existing tests that suUed our particular conceptualization 
of critical thinking or that met our practical needs. The' vast" majority of 
tests that could be considered critical thinking tests, including the two most 
popular critical thinking tests (The Watson Glaser Critical Thinking Appraisal 
and the Cornell Critical Thinking Test), measure discrete ski/la in a 
multiple-choice format (see MePeek , 1982, for a detailed critical review). 



ERIC 



The assessment of discrete skills did not fit with our understanding of skills 
for critical thinking as interdependent and as components of an integrated and 
frequently recursive process, Further, we did not find multiple-choice and 
short-answer testing formats appropriate. While the results of a critical 
thinking process can be expressed , at times, in phrases, statistics, and other 
convenient shorthands that require little more than stating, listing, or 
marking answers, the encouragement of such expression to the exclusion of 
extended discourse has unfortunate pedagogical implications. Multiple-choice 
and short-answer items, while simpler to use than essay items, provide almost 
no insight into the process that generated the student "s response and 
consequently are not particularly useful for diagnostic purposes, A "correct" 
answer might be the result of a lucky guess? an "incorrect" answer, 
completely justifiable, were the student allowed to explain the reasons or 
assumptions that led to that choice. Moreover, such testing formats imply 
that a single correct or best answer exists. This implication is antithetical 
to our goals in teaching critical thinking. The point is not whether students 
can recognize or produce an answer that the teacher or some other authority 
considers correct, but rather whether they can adequately explain, validate, 
and defend the responses they have generated. Rarely, outside the somewhat 
artificial world of the testing industry , do "correct" answers exist. We do 
our students a disservice by not encouraging them to explain and support their 
responses. Even critical thinking tasks that might result in something other 
than language, like a statistic, still require quite extended language to be 
intelligible and convincing to others. ' 

We therefore decided on a testing procedure similar to that used by the 
Third National Assessment of Reading and Literature (NABP. 1981), where 
students were asked to read prose passages or poems and then write essays to 



if 



ERIC 



explain their interpretation, off what they had read, We chose to have 
students read passages relevant to their social studies curriculum and then 
write essays in response to questions which ask them to do such tasks as 
evaluate or draw inferences from what they had read. To be sure, a product of 
critical thought, such as an essay, does not provide an exact image of the 
process. However, asking student,, to state their ideas clearly and justify or 
, defend those ideas with coherent explanations and evidence does provide 
insight into the thinking process (while, at thJ same time, serving as a 
vehicle for that process). ' 

Unfortunately, the use of essay tests for district -wide testing on a 
regular basis presents some practical problems. They are time consuming to 
develop and to read and difficult to score reliably or consistently. There is 
also the danger that teachers will focus on the ^surface « features such as 
mechanics and usage errors . which are easy to score, ignoring the features of 
the essay that provide evidence off. critical thought. To help alleviate these 
problems, we developed an analytic scoring guide, to be described below,, which 
teachers can apply to their student essays relatively quickly to obtain 
relevant diagnostic information . 



Hie Test 



The project goals or objectives that guide both testing and instruction 
are summarized below. Using social studies subject matter, students are 
expected to: 

. read for, in the primary grades, listen to) selected passages; ' 

. prepare for critical thinking, discussion, and /or writing by listing * / 
diagramming, outlining important parts, ideas, events, personages! or 
settings from gaven reading materials; 



o 



6 



. engage in questioning and discussing what they have read for hoard) 
using complete sentences . and providing explanations and evidence from 
their reading materials or other sources to support their statement,;" 

- summarize and paraphrase /orally and in writing) what others have-said. 
revising their own positions on the basis of new information or 
interpretations i 

- write essays that reflect critical thought and that include the elements 
. important to the coherent, persuasive communication of critical thought- 

. revise and edit their essays in light of feedback from peers end 

teachers according to the specific criteria outlined in the Critical 
Thinking Diagnostic Essay Scoring Guide. "*» 

Having put aside the notion that lists of skills can define critical 
thinking, we nevertheless found that an efficient way to help the test 
developers (a "steering" committee of approximately 25 district teacher* and 
supervisors) generate essay questions and instructional materials was to 
provide them with a framework from which to work. ,So we developed the 
tentative list of what' we call, "skills "that contribute to critical thinking"-- 
in the sense that they can prompt open-ended questions, thinking, writing, an d 
discussions. The list that' appears below is our current "working" !i st , 
subject to additions and deletions as the need arises. It was never intended 
to be comprehensive (as we are convinced that is impossible) or taxonomieal i 
It could be much longer and perhaps more detailed, but our steering committee 
advised us that, with a longer list, teacher -sing it might be overwhelmed 
and "lose sight of the forest for the trees." The list simply provides a 
framework from which teachers can build teaching and testing materials. 

Skills that contribute to critical thinking include: 

S 

summary , where students compose 

descriptions of selected parts of reading materials and 
presentations; .• 



reconstructions of events or ideas in sequence from passageif'and 
presentations} and 

brief summaries of events and ideas. 
- classification . where students compose 

comparisons, contrasts, and classifications based on "reading 
materials and presentations ; 

distinctions between ideas and supporting evidence} and 
distinctions between facts and other statements. 
inference ■ where students compose 

explanations and interpretations of events, ideas, and actions; 
conclusions and generalizations from events, ideas, and actions} 
predictions and hypotheses} and 
« opinions and points of view, 

evaluation , where students compose 

judgments of the accuracy, consistency, and completeness of ideas 
and actions} 

judgments about the ethics or values of ideas and actions; 
judgments of the merit or worth of ideas nd actions; and 
criteria for all and any judgments. 
A test booklet consists of (a) a brief reading passage, customarily 
written by members, of our steering committee, Which is selected based upon 
its relevance to the social studies curriculum, the amount and accuracy of the 
evidence it contains, and its inherent interest to students; (b) a sat of 
directions preceding the reading passage to help students focus on the aspects 
of the passage that are relevant to the essay question; Co) the essay 
question itself, which specifies what is expected of the students; and (d ) a 
lined space for writing, directly opposite the reading selection, so that 
students may easily refer back to the passage as they write. A sample test 



booklet, developed for students in grade 11, Is presented in Figure 1. 



Insert Figure 1 about hi 



We developed an analytic scoring guide that would provide specific 
diagnostic information, relevant to instruction, that teachers could apply 
relatively quickly and consistently, and that could be used to giye feedback 
to students. We began by examining the nature of essays that provide evidence 
of students' critical thinking. What,, in other words, we asked ourselves , 
might an essay look like that adequately expressed a student's views, for 
instance, on the evaluation of Thomas Paine 's arguments in Common Sense ? Were 
there necessary elements or aspects of communicating critical thinking that, * 
should surface? 

After grappling with these ■questions and numerous versions of a scoring 
g^de, we decided on five elements which now comprise the first five of the" 
*six categories on the scoring guide ; 

topic statement', which makes a clear, general presentation of the 
student's main idea; • . • ■ 

.evidence, which is accurate, factual information , relevant to the 
topic statement ; ■ 

explanations . which elaborate on the topic statement or show how, 
why, and to what extent evidence cited supports the topic 
statement ; 

concluding statement , which provides a summary statement that draws 
together ideas and evidence ; and . 

organization, which demonstrates a logical, coherent flow of ideas. 
A sixth category, response to task , provides a focused holistic score 
describing how well the student's essay demonstrates the "skill contributing 
to critical thinking" on which the essay question was based. 



ii 



9 



The detailed veraion of the scoring guide from which the- teachers work i e 
.shown irt Figure 2. The definition., which follow each of the six categories 
describe "standard, ■ of acceptable performance for that category; An essay 
can receive one of four possible scores in each category: 

not present . indicating that the element or skill is missing from 
the mmmmyj ^ ^wm 

.standard not met, indicating that the element or skill is present 
in the essay, but that it does not conform to the standard 
described in the guide; 

standard met , indicating that the element or skill is present in 
the essay and that it conforms to the standard described; or 

■ nTth^ eKCeed8d ' indicating that the element or skill is present 
and that it surpasses the standard described. 



Insert Figure 2 about here 



As part of the project's inservice component, teachers receive training 
in the use of the scoring guide. Training materials include sets of sample 
essays scored by consensus of the, steering committee, the rationale for giving 
those scores, and suggestions that the teacher 'might use in conferring with 
individual students to help them improve the quality of their essays. 
Teachers are given the opportunity to score a variety of essays, to discuss 
their reasons for giving the scores they select, and to come to consensus with 
us and among themselves about what the most appropriate score is for each 
sample, essay. (In addition, as a part of staff development, teachers receive 
instruction in such techniques^ as .questioning . discussion, and writing 
activities that directly support their teaching of critical thinking.) 

Here are two sample essays written in response to the question that asks 



12 



10 



11th grade students to evaluate' the arguments made in Thomas Paine 's 
Common Sense (from the sample test booklet presented earlier in this paper). 
Each essay is followed by excerpts from the training materials which provide 
the rationale for the scores given and suggestions for providing feedback to 
the students who wrote the essays, 

Thomas Paine' s arguments are not strong enough to 
convince me to support the fight for independence from 
Britain. His facts are too general, and he does not 
thoroughly support his arguments. 

What Paine states is common sense. However, a 
fight for independence is a drastic step, and Paine 
gives no evidence that such a measure would be 
successful - 

Paine makes statements such as "The American 
economy would be even stronger today if no European 
power had anything to do with it," This is only his 
opinion, who • s to know if he is correct or not? 

Paine states his argument in such a nonchalant 
manner that I am not convinced that a move for 
independence is best. 

Topic Statement (3 - standard exceeded}. The topic statement 
provides the main idea and the position of the student. It also 
indicates the reasons why the student has arrived at that decision. 
2 ' e - the explicit criteria the student is using to evaluate the 
argument* 

The topic statement is fine. 

Evidence (1 ^ standard not met). The student offers only one piece 
or evidence from the text. 

The student could be -asked to identify the various reasons that 
Paine gives for the fight for independence. Next to each reason 
the student could make, a judgment about whether or not the reason 
is acceptable or accurate. Once this has been done, the student 
■might select those reasons or points of evidence that will be dealt 
with in the composition. 

Explanations (1 -standard not met), mere is little evidence of 
clear reasoning or explaining here. The use of "however" in the 
second paragraph attempts to show a relationship between the first 
sentence and the author's position, but it provides only the 



2 

The scoring rationale and suggestions for revision are based upon 
written comments prepared by Stephen M, Koziol . Jr., University of Pittsburgh. 



i 



Q 



1 1 



surface evidence of a relationship rather than a clear statement of 
a position --i.e.. that Paine 'a facts are too general and that he 
does not thoroughly .support his arguments -and the student does not 
deal With either of these points in the composition. The student 
seems to be suggesting that people should fight for something only 
if tney know they will be successful. 

The student could be asked to identify which of Paine •a facts are 
too general and not supported. Then the student could explain why 
those facts are too general or unsupported and offer alternative 
evidence, if possible. The student might draft that section of the 
composition before deciding whether additional attention to "• 
communicating explanations needs to be given. If the three or more 
reasons are identified but the explanations are still not present 
the teacher could focus on having the student identify what the ' 
relationship is between the thesis assertion and the reason, and 
assist the student in selecting words or phrases to reflect that 
5 relationship * 

Concluding Statement (2 = standard met). The word "nonchalant" 
reiterates the position taken in the thesis statement that Pained 
arguments are too general and unsupported. However, the body of ' 
the composition has not dealt adequately with that thesis position 
Therefore, while it is a restatement, it does not draw tooether the 
Sorough d way: den0e ^ ^ ° f ^ -^ion *» - -mplete or 

It is likely that the problems with the conclusion will be 
alleviated by additional attention to the evident- and the 
reasoning. It would be helpful also if the student provided some 
elaboration for the word "nonchalant.'? The teachs: might ask the 
student for synonyms for "nonchalant" and suogest that they be 
included after "nonchalant" in the essay to clarify further just 

fn n hJ f f° r , ' X -" nf>T -* > - " Pai "« hi, argument, d such 

a noncnalant, casual, unsupported manner that ...".» 

Organ! zation (2 = standard met). There is an organization here and 
it is clear. Basically, each part of the composition is placed in 
a separate paragraph and sequenced reasonably. 

The student could also explore how to organize his/her ideas in 
more fully developed paragraphs. How might paragraphs two and 
three be pined together, for example? What else might have to be 
done if this were all one paragraph? The focus here would be on 
helping the student recognize the need to include more relation and 
transition words and phrases when the ideas are compacted into a 
single paragraph. That compaction places greater emphasis on the 
development and support of the thesis idea. 

gesponse to Task (1 - standard not met). The topic statement is 
the strongest indication that the student intends to respond 
comprehensively to the task. The body of the composition fails to 
meet the task demands either in the number of reasons dealt with or 



14 

ERIC 



12 



in the adequacy of the treatment of ideas. The conclusion once 
again suggests that the student was aware of the task demands. 

As discussed under Evidence and Explanations above, the major need 
here is for a better and more complete selection of evidence and 
reasons and a more clearly identified relationship between the 
reasons selected and the thesis statement. 



I will support the fight for independence because 
without our independence we will be a slave country, 

There are several reasons why we need our 
independence from Great Britain, one is that every 
time Great Britain goes to war our American trade goes 
to ruin because of our connection with Great Britain, 
Also there will be no more suffering for us for what 
Great Britain did . if_we_ ai-e _free„_wi 11 will be able to 
make trade with other countries, 

I believe if we get our independence from Great 
Britain there will be better happiness in our country. 

Topic Statement (1 = standard not met), There is a clear 
statement of the student's position as well as a hint about why the 
student has taken that position (evident in the word "slave") . 
However, the notion of "slave" here seems out of place. 

If "slave" is th*j idea that the student wants to communicate, then 
the topic statement will need to be expanded to provide more 
information about how not getting independence equates with being 
enslaved. The student can be asked to explain what he/she means' by 
a "slave country" and asked to reflect about whether those 
qualities are accurate for the British-Colonial relationship at the 
time. The major problem here may be diction (word choice),- asking 
questions about the word "slave" could clarify the intent and lead 
to better definition. 

Evidence (1 m standard not met), Although there seem to be three 
reasons identified, there are in reality only two, Without much 
more explanation, there is little difference between reasons one 
and three, The statement of the evidence is not clear, Just what 
is the student referring to in reason two, where he or she says, 
"there will be no more suffering for us, . , '■? There are some 
possible connections to what Paine says, but these connections .are 
not clear. 

As with the first eleventh grade essay, above, the student needs a 
much better handle on what the evidence is. The teacher might 
proceed much as described in suggestions for revision of evidence 
in the first essay; or. after a conference, the teacher 
might ask for another draft composed as focused free writing that 
might give the student more to work with. 



15 



13 



Explanations (1 =. standard not met). There is little evidence of 
explanation here. Tnere is some indication of sequent" 
reasoning, but the use of the "also •• , 1 1 , 

to the task. yP connector is Inadequate 

- V 

.T' 88ked t0 • an "i f that were not true- or 

if we had independence" statement after each of the "three" 
reasons given. The teacher might then suggest thatinsertina that 

e^iaration"lIa ati ° n *«er each reason woul^^ne 

explanation clearer and far more complete. 

fa-t^fn^ a< h sS£ggBt f2 • standard met). There is a ,,,, of the 

nd <- The °? nClUSion ' A « - th the topic statement it 

S t „ ? 1 P0SltiOn ( " if WC Set our ^dependence from 

tetter hi,™ " M the 8Ment ' s Position ("there will be 

fo"that T J" t :" r — "y")- However, once again, the BOU e 

Clarification of the meaning and intent of "slave" in the thesis 

the bSy of the" m ^ explanation of the evince in 

the body of the composition should make the conclusion more specific 

Sothe/iraft ofth hpr6 r in - the student - g ht be aS k: d 9 t 0 p j: ific 

the teLher ^ "" Clu81OT the initial conference with 

Organization CI = standard not met). The composition has an 

sent^fspeoififS lZT andT^T ' ^ ^ -.ai^ topic 
jecond paragraph ^^1^ S J^^ ? 

intonate »» ^ " ^ ^fusion 




the evidence supports that topic statement. T^e ™U£ to bfa 

Se^Lpo it i 1 ™^^*^ 0 -? 1 ^- rjTot 
Evidence S", f again, folic >■ g the suggestions above under 

Evidence and Explanations might guide those changes. 

As the essay tests are currently used by the district, they are in no 



ERIC 



16 



14 



sense standardized tests Ths - 

j.£t?u ce u t„ rne teat forma are not equivalent; different 

reading passages and essay questions make the difficulty of the tests 
variable, and hence not useful for comparative purposes. Although we try to 
maintain as much consistency as possible in the use of the scoring guide, the 
situation does not permit sufficient control to ensure its reliable use. A 
. random sample of papers is double -scored by the steering committee sftar each 
test administration, and reliability, defined in tor. ,* of percent agreement 
between the two scorers, is not as high aa we wr.Ud like. For the most recent 
administration, percent of perfect agreement ranged from 47% to 50% across the 
six scoring categories, and percent agreement within one score point ranged 
from 91 « to 95%. Another set of papers was scored by ,.,,„•, who were more 
thoroughly trained and not otherwise connected with the project. Here, 
percent of perfect agreement ranged from 57% to 68%. and percent agreement 
within one score point ranged from 96% to 98%. This suggests that more 
thorough training of raters who are not scoring their own students' essays 
will produce more reliable scores. Since the scoring guide is presently used 
primarily. for diagnosis, for conferring with students, and for planning 
instruction, we are' not overly concerned with reliability at this time. while 
the scoring guide could be used to monitor student achievement, it would have 
to be applied under more controlled conditions (on equivalent test forms, with 
centralized double or triple scoring of all papers > . 

Although we call our te S=S Critical Thinking Tests, we recognize that the 
label is somewhat arbitrary. As yet. the construct, critical thinking, is a 
hazy one. operating in a poorly defined nomolegieal network. Not 
surprisingly, the semantic confusion caused by diverse theoretical definitions 
of critical thinking is reflected in the diverse operational definitions 
represented by different tests. This lack of commonality among "critical 



17 



is 



thinking" tests is further compounded when one compares tost X? formate, which 
can be oral, written, non-verbal . multiple-choice, short-answer, or essay 
tests -all features which put different demands on the examinee. We developed 
this test because nothing available suited our beliefs about how critical 
thinking should be defined and measured and, at the same time, met the needs 
of the district for a diagnostic test that would complement critical thinking 
instruction without overburdening social studies teachers. A construct 
validation study is planned for the 1933=84 academic year, the results of 
which should be available by September 1984, We intend to explore the 
relationship between student performance on these tests rated by the 
Diagnostic Scoring Guide, and variables representing explainable rater bias, 
test form. difficulty, student achievement in reading, language arts, and 
social studies (including prior knowledge relevant to the essay question), and 
student performance on other available measures of critical think; j. This 
information should increase our understanding and improve our interpretations 
of scores students receive on MAP Critical Thinking Tests. 



18 



16 



REFERENCES 



Cornbleth. c. Inquxry Theory and Social Studies CurrAmUa^ Problems in 

Planning Z2L mmm- P *P^ presented at the American Educational 
Research Association, Toronto, March 1378. 
McFeck, J. fi. Critical Thinking and Education . New York - St. Martin 
Press , 1381 . 

National Assessment of Educational Progress. Reading, thinking, and 

Wfiti "' r R "»" ! "> the 1979 -80 National Aaaessment of Reading and 



Literat ure (Report No. II r, 01 ) . Denver. Co 



lo„ • Kduc.it ion Commission of 



the States. 1981. (ERIC Document Reproduction Service No. ED 209 641) 



19 



ERIC 



Name 
Date 



Teacher 
School 



Grade & Class 




PITTSBURGH PUBLIC^ SCHOOLS 
MONITORING ACHIEVEMENT IN PITTSBURGH 
(MAP) 

Critical Thinking Test 
Grade 11: Form 2 



ERLC 



Field Test Version, 1983-1984 School Year 



11-2-2 



DIREC ^??5- : l m lV ne y0U are a co,on1st wading the following article by Thomas Pain P 

indeplnSn e^rom SSftS X ^ ^ ?o support the^fi t ? " 

decision? Britain. Then write a composition in which you explain your 



C/5 
to 
3 

M 



COMMON SENSE* 
by Thomas Paine 

common s'eTse. ' " 9 ^ ' : "" ,,le faCtS ' pla1n ar 9« nts . and 

and America* T^ft*! wr1 «f K ab f ut the struggle between England 
and America. !\ .p e from all backgrounds have taken oart in the 

C "™' t f' J' 1 at ! em Pts t° ^ach a compromise have failed 
The time to talk is past. War must now decide the question. 

nt,J y beC ° min9 3 mtter 0f Wdrfare instead of debate, the struggle 

a new wav A^'tZ* f po]1 V°^ Pe0p1e win now have to think 9 ' 
a new way Al the plans and proposals that were made before the 
battles or Lexington and Concord last year are useless now As much 
has been said about the advantages of Caching a settlement w h Great 
n ' we fhould now look at the other side of the argument. We 

tufflt ES 1 ? many , ' nj ' Ur,es we have Offered and- will continul to 
suffer because of our connection with Great Britain. 

Many people say that, since America has prospered under British ' 
rule, we should not change that arrangement. Nothing coSld be more 
incorrect than that kind of thinking. The American economy would be 
ThSL St r?; ge r todd * 1f no European power had anything to do with it 

People also think that we need Great Britain to protect us Thev 

?rue ?Ve T£ a BrfS,h r i t1 ; h ] r l V- us because the ^ like'us. This 1s not 
lint: V 5 tlS u Drotect us because they need us as. a market They 
protect us from their own enemies, France and Spain, not from ours 

Some say that Britain is our parent country. That is only parti v 
true, Europe not just England, Is the parent country of America * 
The. new world has been the asylum for the persecuted lovers of civil 
and religious liberty from every part of Europe. 

Whenever England goes to war, American trade goes to ruin, because 

v.virl f ,> 0 'i , ??p Wlt 5 l -V laiU - lt mak6S 900d Se "Se for America to 
separate itself from Britain. That we are an ocean away from them is 
a strong argument for our going our own way. 

h a f J here 1s some t n ,| n 9 absurd in supposing it right for a continent to 

on f :f r vor governed by a small island. U is in America's best political 
economic, and social interests to be a separate and independent state 
lo^g ru?. Sh0rt ° f independen « 1«. ™re patchwork and canSoc working 

21 



^^f^^Z^"^: ^^r^Tsmrwwmm im^u^ m P h TTaoTfpfrraT 



ERIC 



11-2-3 



DIRECTIONS: Now l magi ros you are a colonist reading Thomas Panne's Common 
^ense. Deride whether or not his arguments have convinced you to 
support the fight for independence from Britain, Then write a 
composition in which you explain your decision. Your composition 
should include: 

- a topic statement telling whether or not you would support the 
fight for independence; 

- the main points Thomas Paine makes in the article; 

- explanations of why these points would or would not convince you; and 

- a concluding statement that sums up your position. 



If you need more space, you may write on the back of this page. 

22 



18 



MAP CRITICAL THINKING PROJECT 

DIAGNOSTIC SCORING GUIDE 
DETAILED VERSION 



Top f c 

5 ta tenant 



Evidence 



E*pl nnation? 



Concluding 
Eta teiiiofi t 



Organisation 



Response 
to Task 



-makes clear, general presentation of main idea 
-may be one or more sentences 

-usually appears at beginning, but may be found in middle 

lassSne^r"^^^ ( !? Cts or other citations) from reading 
passage or from student's general knowledge 
-is relevant to topic statement or main points 
-is accura te 

-is sufficient to support topic statement 

-are elaborations of key point s in topic statement 

1iml° C rep?t?Sf^ Ch , a 5 'V 1 instruction, d e fi n ition S of 

-relates to topic statement 
-may paraphrase topic •.: . 

-presents logical, meaningful flow of ideas 

-arranges sentences and paragraphs to follow from each other 

Ir'Su.Hng ^r^^ Y™' z ing, classifying, inferring- 
□r evaluating as requested in essay prompt 



SC0E_PO[kTS 

0 * not present 

1 * standard not wot 

2 " standard mot 

3 * standard exceeded 



The element or skill is missing from the 
essay, Revision should be reco/mended. 



The element or skill is present in the essay 
but u docs not eonfom to the standard as ' 
defined. Revision should be ree 



The element or skill is present in the essay 
and it confers to the standard as defined 
Kevision may be reccn-nemk-d. 

The element or skill is present in the cs4 
and it exceeds the standard as defined No 
revision is needed, , w 



Figure 2. Diagnostic Scoring Guide 



23 



